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sequence of the polypeptide encoded in the extended ORF in the derived sequence. 

Fig. 27 shows the sequence of the HCV cDNA in clone 12f, the segment which overlaps clone 14i t and 
the amino acids encoded therein. 

Fig. 28 shows the sequence of the HCV cDNA in clone 35f t the segment which overlaps clone 39c, and 
5 the amino acids encoded therein. 

Fig. 29 shows the sequence of the HCV cDNA in clone 19g, the segment which overlaps clone 35f, and 
the amino acids encoded therein. 

Fig. 30 shows the sequence of clone 26g, the segment which overlaps clone 19g, and the amino acids 
encoded therein. 

w Fig. 31 shows the sequence of clone 15e, the segment which overlaps clone 26g, and the amino acids 
encoded therein. 

Fig. 32 shows the sequence in a composite cDNA, which was derived by aligning clones 12f through 
15e in the 5* to 3* direction; it also shows the amino acids encoded in the continuous ORF. 

Fig. 33 shows a photograph of Western blots of a fusion protein, SOD-NANBs-1-1, with chimpanzee 
75 serum from chimpanzees infected with BB-NANB, HAV, and HBV. 

Fig. 34 shows a photograph of Western blots of a fusion protein, SOD-NANBs-i-i. with serum from 
humans infected with NANBV, HAV, HBV, and from control humans. 

Fig. 35 is a map showing the significant features of the vector pAB24. 

Fig. 36 shows the putative amino acid sequence of the carboxy-terminus of the fusion polypeptide 
20 C1 00-3 and the nucleotide sequence encoding it. 

Fig. 37A is a photograph of a coomassie blue stained polyacrylamide gel which identifies C100-3 
expressed in yeast. 

Fig. 37B shows a Western blot of C100-3 with serum from a NANBV infected human. 

Fig. 38 shows an autoradiograph of a Northern blot of RNA isolated from the liver of a BB-NANBV 
25 infected chimpanzee, probed with BB-NANBV cDNA of clone 81 . 

Fig. 39 shows an autoradiograph of NANBV nucleic acid treated with RNase A or DNase I, and probed 
with BB-NANBV cDNA of clone 81 . 

Fig. 40 shows an autoradiograph of nucleic acids extracted from NANBV particles captured from 
infected plasma with anti-NANBs-i-i, and probed with ^P-labeled NANBV cDNA from clone 81. 
30 Fig. 41a and b shows autoradiographs of filters containing isolated NANBV nucleic acids, probed with 
^P-labeled plus and minus strand DNA probes derived from NANBV cDNA in clone 81. 

Fig. 41-1 shows the homologies between a polypeptide encoded in HCV cDNA and an NS protein from 
Dengue flavivirus. 

Fig. 43 shows a histogram of the distribution of HCV infection in random samples, as determined by an 
as ELISA screening. 

Fig. 44 shows a histogram of the distribution of HCV infection in random samples using two 
configurations of immunoglobulin-enzyme conjugate in an ELISA assay. 

Fig. 45 shows the sequences in a primer mix, derived from a conserved sequence in NS1 of 
fiaviviruses. 

40 Fig. 46 shows the HCV cDNA sequence in clone k9-1, the segment which overlaps the cDNA in Fig. 27, 
and the amino acids encoded therein. 

Fig. 47 shows the sequence in a composite CDNA which was derived by aligning clones k9-1 through 
15e in the 5* to 3' direction; it also shows the amino acids encoded in the continuous ORF. 

45 I. Definitions 

The term "hepatitis C virus" has been reserved by workers in the field for an heretofore unknown 
etiologic agent of NANBH. Accordingly, as used herein, "hepatitis C virus" (HCV) refers to an agent 
causitive of NANBH, which agent is a virus characterised by:(i) a positive stranded RNA genome; (ii) said 

so genome comprising an open reading frame (ORF) encoding a polyprotein; and (iii) the portion of said 
polyprotein corresponding to Figure 14 having at least 40% homology to the amino acid sequence in Figure 
14. This agent was formerly referred to as NANBV and/or BB-NANBV. The terms HCV, NANBV, and BB- 
NANBV are used interchangeably herein, but all refer to the virus as defined above. As an extension of this 
terminology, the disease caused by HCV, formerly Called NANB hepatitis (NANBH), is called hepatitis C. 

55 The terms NANBH and hepatitis C may be used interchangeably herein. 

The term "HCV", as used herein, denotes a viral species which causes NANBH, and attenuated strains 
or defective interfering particles derived therefrom. As shown infra., the HCV genome is comprised of RNA. 
It is known that RNA containing viruses have relatively high rates of spontaneous mutation, i.e., reportedly 
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FIG. 32- I COMBINED ORF OF DNAs 12f through 15e 
.TAAAT^AGTCCTACATGCAttCTCCCC^ 



1 CCATA' 
GGTA' 



61 AC 



TrpThrArgGlyGluArgPysAspLeuGluAspArgAspArgSe^luI^^TO^u 
: :^^/^^^rvT«rrira arr^Tftfi ArrTT<^TCCCTGTCCAGGCTCGAGTCGGGCA 



ACTGGACGCGGGGCGAACGTTGCGATCTGG AAGACAGGGA<JA^ * 
TCACCTG^CCCCGCTTCK^CGCTAGAC^TTCTGTCCCTC 

I«uI/MiThrThrThrGlnTrpGl^^^ 

SerThrGlyl^uIleHisLeuHisGlnAsnlleValAspValGlnTyi^T^lyVal 
tgtSaccggoctcatccacctccac^agaacattgtggaogtgcagtacttstacgggg 

wggtSgSt^gtcc^^ 



181 

ACA( 




301 

ACGAAi 



LeuAlaAspAlaArgValCysSerCysI^uTrpMetMet^uI^uIleS^lnAlaGlu 
TGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGTGGATGA 

ACGAACGTCTCCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCC 

AlaAlal^uGluAsnl^uValllel^uAsnAlaAlaSerl^uAlaGlyO^Hi^lyLeu 

361 AGGCGGCTO1X3<»GAACCTO^ 

TCCGCCGAAACCTCTTGGAGCATTATGAATTAOGTCGTAGGGAC06GCCCTGCGTGCCAG 

ValSerPheLeuValPhePhe^sPheAlaTrpl^rl^uLysGlyLys^pValPr^ly 
421 TTGTATCCTTCCTCGTGTTCTTCTGCTTTGCATGGTATTTGAAGGG 

AACATAGGAAGGAGCACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGC 

MaValTyxThrPheTyrGlyMetTrpProLeuI^u^^ 
481 GAGOGGTCTACACCTTCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTO 

CTOGCCAGATGTGGAAGATGCCCTACACCGGAGAGGAGGACGAGGACAACX3GCAAOGGGG 

Ai^AlaTyrAlaXeuAspThiCluValAlaMaSerCysGlyGlyValValLeuValGly 
541 AGCGGGCGTACGCGCTGGACACGGAGGTGGCCGCGTCGTGTGG^TC 

TCGca:GCAraxxx;ACCTGTGCCTCCA^ 

LeuMetAlal^uThrl^uSerProTyrTyrLy sAr^Tyr I leSerTrpCysI^u^pTrp 
601 ggttgatggcgctgactctctcao^tattacam 

CCAACTACCGCGACTGAGACy^TGGTATAATGTTCGCGATATAGTCGACCACGAACACCA 

LeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHisValTrpIleProProLeuABii 

661 GGCTTCAGTATTTTCTCACCAGAGTGGAAGCGCAACTCCA^ 

CCGAAGTCATAAAAGACKMTCTCACCTTCGCGTTGACGTGCACACCT 

ValArqGlyGlyArgAspAlaValllel^uLeuMetCysAlaValHisProTtoLeuVal 
721 ACGTCC^ AGGGGGGCGCG ACGCCGTCATCTTACTCATG TGTGCTGTACACCCGACTCTGG 
TGCAGGCTCCCCCCG0GCTG0GGCAGTAGAATGAGTACACACGACATGTGGGCTGA6ACC 

PheAspIleThrLysLeuLeuLeuMaValPheGlyProI^TrpIleLeuGlnAlaSer 

781 TATTTCACATCACCAAATTCCTGCTGGCCGTCTTCGGACCCCTT^ 

. . . nm ^m.^m^/~mfT«ii* »iviirc>nrrar &RAArOTrKra?GAAA(X^AAGAAGTTCGGT 



AT, 

841 



901 



ACGTC(XjAGGGGGGCGCGAOGCCGTOU!CTTACTCATG 

TGCAGGCTCCCCCCG0GCTG0GGCAGTAGAATGAGTACACACGACATGTGGGCTGA6ACC 

PheAspIleThrLysLeuLeuLeuMaValPheGlyPro^TrpIleLeuGlnAlaS 
lTTTCACAT<^CCAAATTCCTGCTGGCCGTCTTCGGACCCCTTTX3GA 
'AAACTGTAGTGGTTTAACGACGACCXMCAGAAGCCTGGGGAAACCTAAGAAGTTCGGT 

I^uI^uLysValProTyrPheValArgValGlnGlyLeuLeuArgPh^^3^I«uA: 
GTTTGCTTAAAGTACCCTACTTTGTGOGCGTCCAAGGCCTTCTCCGGTTCTGOGCGTTAG 
CAAACGAATTTCATGGGATGAAACACGCGCAGGITCCGGAAGAGGCCAAGAC^^ 

ArgLysMetlleGlyGlyHisTyrValGlnMetValllelleLys^lyJU^I^TI 
CGCGGAAGATGATC(MA(^CCATTACGTGCAAATGGTCATCATTAMTO 
GCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAAT 
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GlyThrTyrValTyrAsnHisLeuThrProLeuArgAspTrpAlaHisAsnGlyLeuArg 

961 CTGGCACCTATGTMATAACCATCTCA 

GACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCC^ 

AspLeuAlaValAlaValGluProValValPheSerGlnMetGluThrLysLeuIleThr 
1021 GAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTXX^CAAATGGAGAC 
CTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTO 

TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
1081 CGTGGGGGGCAGATACCGCCGCGTGCGGTGACATCATCAAO^ 
GCACCCCCOGTCTATGGCGGCGCAOGCCA^ 

ArgGlyAr^luIleLeuIjeuGlyProAlaAspGl^ 
1141 GCAGGGGCCX3GGAGATACTGCTOGGGCCAGCCX5AT(3GAAT^ 

CX5TCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTC 

LeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysIlelleThr 
1201 TGCTGGCGCCCATCATOGCGTACGCCCAGCAGACAAGGGGCCTCCT 
ACGACCGCXXK3TAGTGCCGCATGCGGGTCGTCTC 

SerLeulta^lyArgAspIiysAsnGlnValGluGlyGluValGlnlleValSerThrM 
1261 CCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTG 
GGTCGGATTGACCGGCCCTGTTTTT^ 

AlaGlnThrPheLeuAlaThrCysIleAsnGlyValCysTrpThrValTyrHisGlyAl^ 
1321 CTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACTGTCT 
GACGGGTTTGGAAGGACOGTTGCACGTAGTTACCCCACACGACOT 

GlyThrArgThrlleAlaSerProLysGlyProVallleGlnMetTyrThrAsnValAsp 
1381 CC^GAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCC^ 

GGCCTTGCTCCTGGTAGCGCAG TGGGTTCCCAGG ACAGTAGGTCTACATATGGTTACATC 

GlnAspLeuValGlyTrpProAlaProGlnGlySerArgSerLeuThrProCysThri^ 
1441 ACCAAGACCTTCTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCAT^ 

TGGTTCTGGAACACC0GAC0GGGCX3AGGCGTTCXIATCGGCGAGTAACTGT^ 

GlySerSerAspLeuTyrLeuValThrArgHisAlaAspVallleProValArgArgArg 
1501 GCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACGCCGATGTCATTCCCGTGCGC^ 

CGCOGAGGAGCCTGGAAATGGACCAGTGCTCCGTGOGGCTACAGTAAGGGCAOGCGGCCG 

GlyAspSerArgGlySerLeuLeuSerProArgProIleSerTyrLeuLysGlySerSer 
1561 GGGGTGATAGCAGGGGCAGCCTGCTCTOGCCCCGGCCCATTTCCTACl^ 

CCCCACTATCGTCCC0GTCGGA0GACAG<X3GGGCCGGGTAAAGGATGAACTTTCCGAG^ 

GlyGlyProLeuLeuCysProAlaGlyHisAlaValGlyllePheArgAlaAlaValCys 
1621 cggggggtccgctgttgtgcrccgcggggcacgccgtgggcatat^ 
gcxxx:ccaggogac^cacggggcxk:cccgtgcggcacccgtata^ 

ThrArgGlyValAlaLysAlaValAspPKelleProValGluAsnLeuGluThrThrMet 

1681 gcacctotggagtggctaaggcggtcgactttatccctgtggagaac^ 
cgtgggcacctcaco^ttcxxsc 

ArgSerProValPheThrAspAsnSerSerProProValValProGlnSerPheGlnVal 
174 1 TGAGGTCCCCGGTGTTCACGGATAACTCCTCTCCACCAGTA^ 

ACTCCAGGGGCCACAAGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTC 

AlaHisLeuHtsAlaProThrGlySerGlyLysSerThrLysValProAlaAlaTy^ 
180 1 TGGCTCACCTCCATGCTCCCACAGGCAGCGGCAAAAGCACCAAGGTC 
AOOGAGTGGAGGTAOGAGGGTGTCCGTCGCCGTTTTCGTGGTTCC^ 

MaGlnGlyTyrLysValLeuValLeuAsnPro^^ 
1861 CAGCTCAGGGCTATAAGGTGCTAGTACTCAACCCCTCTGTTGCTGCAA 
GTCGAGTCCCGATATT(XACX;AT(^TGAGTTGGGGAGACAAC^ 

AlaTyrMetSerLysAlaHisGlylleAspProAsnlleAi^ThrGlyValAr^Thrlle 

FIG. 32-2 
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1921 GTGCTTACATG TCCAAGGCTCATGGGATCG ATCCTAACATCAGGACCGGGGTGAGAACAA 
CACG AATGTACAGG TTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCC^ 

ThrThrGlySerProIleThrTyrSerThrTyrGlyLysPheLeuAlaAspGl 
1981 TTACX*CTGGCAGCCCCATCACGTA^ 

AATGGTGACCGTCGGGGTAGTCCATGAGGTGGATGCCGTTCAAGGAACGGC^ 

SerGlyGlyAlaTyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
2041 GCTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTC 
CGAGCCCCCCXSCGAATACTGTATTATTAAACACTGCTC^ 

IleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValVal 
2101 CCATCTTGGGCATCX3GCACTGTCXriTCA<XAAGCAG 

GGTAGAACCCGTAGCCGTGACAGGAACTGGTTCSTCTC 

I^uAlaThrAlaThrProProGlySerValThj^alProHisProAsnlleGluGluVal 
2161 OTCTCGCC^CCGCCACCCCTCCGGGCTC^ 

ACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGG^ 

MaLeuSerThrThrGlyGluIleProPheTyrG^ 
2221 TTGCTCTGTCCA(XlACCGGAGAGAltXXITTTTTACGGCA 

AACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGA 

LysGlyGlyArgHisLeullePheCysHisSerLysLysLysCysAspGluLeiiAlaAla 
2281 TCAAGGGGGGGAGACATCTCATCTTCTCTCATTCAAAGAAGAAGTC 

AGTTCCCCCCCTCTGTAGAGTAGAAGACAG TAAGTTTCTTCTT(^CGCTGCTTG AGCGGC 

LysI^uValAlaLeuGlylleAsnMaValMaTyrTyrArgGlyLeuAspValSerVal 
234 1 CAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTC 
GTTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATC 

IleProThrSerGlyAspValValValValAla^ 
2401 TCAOXXJOGACCAGCGGCGATGTTCTOGTCG^ 

AGTAGGGCTGGTOGCCGCTACAACAGCAGCACCGTTGGCT 

GlyAspPheAspSerVaaileAspC^sAsnThrCysValThrGlnThrValAspPhcSer 
24 61 CCGGCGACTTCGACTCGGTGATAGACTGCAATACGTCTGTC^ 
GGCCGCTGAAGCTGAGCCACTATCTCAO^^ 

LeuAspProThrPheThrlleGluT^ 
2521 GCCTTGACCCTA(XTTCACCATTGAGACAATCACGCTCCCCCA^ 
CGGAACTGGGAIX^AAGTGGTAACTCTGTTAGTGOGAGG 

GlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArgPheValMaPr^ 
2581 CTCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCT 
GAGTTGCAGOXXXSTCCTGACeGT^ 

GluAr^ProSerGlyMetPheAspSerSerVall^^ 
2641 GGGAGCGCCOCTCCGGCATGTTCX5ACT(X3TCCGTCCTCT^ 
COCTOGOGGGGAGGCOGTACAAGCTGAGCAGGCAGGAG 

MaTrpTyrGluLeuThrProAlaGluThrthrValArgLeuAr 
2701 GTGCTTGGTATGAGCTCAOGCCOGCOGAGACTACAGTTAGGCTACGAGCCT 
CACXjAACCATACTCGAGTGOQGGOGGCTCTGATGTCAATCOGATGC^ 

ProGlyI^uProValCysGlnAspHisLeuGluPheTrpGluGlyValPheThr<51yLe^ 
2761 CCCCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTT^^ 
GGGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACC^ 

ThrHisIleAspAlaHisPheLeuSerGlnThrLy^ 
2821 TCACTCATATAGATGOCCACTTTCTATCCCAGACAAAGCAGAGTGG 
AGTCAGTATATCTAOGGGTCAAAGATAGGGTCrcm 

LeuVamaTyrGlnMaThrValCysM 
2881 ACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAA6CCCCTC 
TGGACCATCGCATGGTT0GGTGGCACA0GCGATCXX3GAGTT^ 

FIG. 32-3 
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GlnMetTrpLysC^sLeuIleArgJ^uLysProThrLeuHisGlyProThrPzoLeuIieu 

294 1 ACCAGATGTGGAAGTGTTTGATTC^ 

TGGTCTACACCTTCACAAACTAAGCGGAGTTCG^ 

TyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 

3001 TATACAGACTGGGCGCTGTTCAGAATGAAATCA^ 

ATATGTCTGACCCGCGACAAGTCTTACTTTAGTG^ 

MetThrCysMetSerMaAspLeuGluValValThrSerThrTrpValLeuValGlyGly 

306 1 TCATGACATGCATGTCGGCCGACCTGGA 

AGTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCT^ 

ValLeuMaAlaLeuAlaMaTyrCysLeuSerThrGlyCysValValll^ 

3121 GCGTCCTGGCTGCTTTGGCOGCGTATTGCCTGTCM 

CGCAGGACCGACXJAAACCGGCGCATAAOGGACAGTTGTC 

ValValLeuSerGlyLysProAlallelleProAspArgGluValteuTyrArgGluPhe 
3181 GGGTCGTCTTGTCCX3GGAAGCCGGCAATCATA(XTGACAG^ 
CCCAGCAGAACAGGCCCTTCGGCX^CTAGTATGGACTGTC^ 

AspGluMetGluGluCysSei^lnHisI^^ 
3241 TCGATGAGATCGAAGAGTGCTCTCAGCACTTACOGTACATCGAGCAAGGGAI^^ 
AGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCT 

GluGlnPheLysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVal 
3301 CCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGOT 
GGCTCGTCAAGTTCGTCTTCCX5GGAGOCGGAGGACGTC 

IleAlaProAlaValGlnThrAsnTrpGl^ 
3361 TTATCGCCCCTGCTCTCCAGACCAACTGGCAAAAACTCGAGA 
AATAGCGGGGAOGACAGGTCTCGTTGACCGTTTTTGA^ 

l^pAsnPhelleSerGlylleGlnTyrLeuAlaGlyl^uSerThrLeuProGlyAsnPTO 
3421 TCTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTC 

ACACCTTGAAGTAG TCACCCTATGTTATG AACCGCCCG AACAGTTGCGACGGACCATTGG 

AlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSer^ 

3481 CCGOCATIX^TTCATTGATGGCTTTTACAGCTGCTGTCACCAGCC 

GGCGGTAACGAAGTAACTACCGAAAATGTCGAOGACAGTGGTCGGGTGATTGGTGATOGG 

ThrLeuLeuPheAsnll^uGl^ 
354 1 AAACCXrrCCKrTTCAACATATTGGGGGGGTGG^ 
TTTGGGAGGAGAAGTTGTATAACCCCCC^ 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGly 

3601 CCGCTACTGCCTTTGTGGGOGCTGGCTTAGCTGGC^CGCCA 
GGCGATGACGGAAACACXXXXX5AC(X3AATCGACCGC^ 

LysValLeuIleAspIleLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAla 
3661 GGAAGGTCCTCATAGACATCXITTGCAGGGTATGGOGCGGGCGTGGCGGGAG 
CCTTKX^GGAGTATCTGTAGGAACETC^ 

PheLysIleMetSexGlyGluValProSerThrGluAspLeuValAsnLeuLeuP 
3721 CATTCAAGATCATGAGCGGTGAGGTCCCCTC^ 

GTAAGTTCTAGTACTOGCCACTCCAGGGGAGGTGCCTC 

IleLeuSerProGlyAlaLeuValValGlyValValCysAlaAlalleLeuArgArgHis 
3781 CCATCCTCTOGCXCGGAGCCX^rCGTAGTCGGOGTGGTCTGTGCAGCAATACTGCGCOGGC 
GGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATC 

ValGlyProGlyGluGlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArg 
3841 ACGTTGGCCCX^CGAGGGGGC^TGCAGTGGATGAACCGGCTGATAGC 
TGCAACCGGGCCCGCTCCXICOT 

GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValThr 

FIG. 32-4 
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3901 GGGGGAACCATGTTTCCCCCACGCACTACX9TGCCG6AGAGCGATGCA6CTGCCCGGGTCA 
CCCCCTTGGTACAAAGGGGGTGCGTGATGCACGGCCT^ 

AlalleLeuSerSerLeuThrValThrGlnte^ 
3961 CTGCCATACTCAGCAGCCTX^ACTOTAACCCAGCTCC^ 

GACGGTATGAGIX^TCGGAGTCACAITGGGTCGAGGACTCCGCTC 

SerGluCysThrThrProCysSerGlySerTrp^ 
4021 GCTOGGAGTGTACCACTCCATGCTCOGGTTCCTGGCTAAGGGACATCTGGGACTG 
CGAGCCTCACATGGTGAGGTACX5AGGCCAAGGACOGATTCCCTGTAGACC 

GluValLeuSerAspPheLysThrTrpI^uLyaAlaLysLeuMetProGlnlfeuProGly 
4081 GCGAGGTGTTGAGCGACTTTAAGACCTO^ 

CGCTCCACAACTOGCTGAAATTCTGGACCGATTTl^ 

IleProPheValSei^sGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMet 
4141 GGATCCCCTTTGTG TCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGG ACGGCATCA 
CCTAGGGGAAACACAGGAOGGTCGOGCCCATATTOGCCCAGAG^ 

HisThrArgCysHisCysGlyAlaGlulleOlu^lyHisValLysAsnGlyThrMetArg 
4201 TGCACACTCGCTCCCACTGTGGAGCTGAGATCACTG^ACATGTCAA 
ACGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTT^ 

IleValGlyProArgThi^sArgAsnMetTrpScrGlyThrPheProIleAsnAlaTyr 
4261 GGATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCT 
CCTAGCAGCCAGGATCCTGGACGTCCTTC^ 

ThrThrGlyProCysThrPrc^uPrcAlaProAs^ 
4321 ACACCATOGGCCCCTGTACCCCCCT^^ 

TGTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGC 

SerAlaGluGluTyrValGluIleAr^GlnValGlyAspPheHisTyrValThrGlyMet 
4381 TGTCTOCAGAGGAATATGIX^AGATAAGGCAGG^^ 

ACAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCC^ 

llu^hrAspAsnLeuLysC^sProCysGlnValProSerProGluPhePheThrGluLeu 
4441 TGACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCC^ 

ACTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTC 

AspGlyValArgLeuHisArgPheMaProProCysLysProLeuLeuArgGluGluVal 
4501 TGGACGGGGTGCGCCTACATAGGTTTGCGCCCCCCTGCAAGCCCTO 

ACCTGCCCCAOGOGGATGTATCCAAAOGCGGGGGGACGTTOGGGAAOGAOGOCC 

SerPheArgValGlyLeuHisGluTyrProValGlySerGlnteuProCy 
4561 TATCATTCAGAGTAGGACTCCACXAATACCCGGTAGGGTCGCAATTA 
ATAGTAAGTCTCATCCTGAGGTGCTTATGGGCCAT^^ 

ProAspValAlaVa]J>2uThrSerMetI^u^ 
4621 AACQ^ACGTCGCCGTGTTGACGTCCATGCTCACTGATCCC^^ 
TTGGCCTGCACCGGCACAACTGCAGGlftCGAGTGACT^ 

AlaGlyArgArgLeuAl2^gGlySerProPrx>SerValAlaSerSerSerAlaSerGln 
4681 OGGC0GGGCGAAGGTTGGOGAGGGGATCACCCCCCTCTC 
GCOGGCOC^CTTCauUX^CTCOOC^^ 

LeuSerAlaProSerLeuLysMaThrCysThrMaAsnHisAspSerProAspAlaGlu 
4741 AGCTATCQ3CTCCATCTCTCAAGGCAACTTGCACCGCT 

TQ5ATAGGCGAGGTAGAGAGTTCCGTTGAACG TGGCG ATTCGTACTGAGGGGACTACGAC 

LeuIleGluAlaAsnLeuLeuTrpArgGlnGluMetGlyGlyAsnlleThrArgValGlu 
4801 AGCTCATAGAGGCCAACCTCCTATGGAGGCA^ 
TOGAGTATCTCC^TTGGAGC^TACCTCrc 

SerGluAsnLysValVaXIleLeuAspSerPheAspProLeuValAlaGluGluAspGlu 
4861 AGTCAGAAAACAAAGTGGTGATTCTGGACTCCTTOGATCCGCTTGTGGCGGAGGAGGAte 
TCAGTCTTTTGTlTrCACCACTAAGACCTCAGGAAGCTAGGCGAACA 
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ArgGluIleSerValProAlaGluIleLeuArgLysSerArgArgPheAlaGlnAlaLeu 
4921 AGCGGGAGATCTCCGTACCCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCC 
TCGCCCTCTAGAGGCATGGGOTTCTTTAGGACGCCTTCAGAGCCTCTAAGCGOT 



4981 



ProValTrpAlaArgProAspTyrAsnProProLeuValGluThrTrpLysLysProAsp 
TGCCCGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCG 
ACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTO 

TyrG luProProValValHisGlyCysProLeuProProProLysSerProProValPro 
504 1 ACTACGAACCACCTGTGGTCCATGGCTGTCCQCTTC 

TGATGCTTGGTGGACACCAGGTACOGACAGGOGAAGGTGGAGGTTTCAGGGGA 

ProProArgLysLysArgThi^alVal^^ 
5101 CTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCT 
GAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGA 

GluLeuAlaThrArgSerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThr 
CCG AGC TCGCCACCAG AAGCTTTGGCAGCTCCTCAACTTCCGG 
GGCTCG AGCGGTGG TCTTCG AAACCGTCG AGGAGTTGAA 

ThrThrSerSerGluProAlaProSerG^ 
CGACAACATCCTCTGAGCCCGCCCCTTC1XMCTGCCCCCCCGACTC 
GCTGTTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCl^GGCTGCGACTCAGGA 

SerSerMetProProLeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrp 
ATTCCTCCATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCAT 
TAAGGAGGTACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTA 

SerThrValSerSerGluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSer 
GGTCAACGGTCAGTAGTCAGGCCAACGCGGAGGATGTCGTGTGCTC 
CX^GTTGCCAGTCATCACTCCGGTlX^C^CTrc 

^^pThrGlyMaLeuVaXThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAla 
CTTGGACAGGCG<^CTCGTCACCCCGTGCGCCGCGGAA 

GAACCTGTCCGCXSTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCra 

^»^^ erAsnSer ^^^9HisHisAsnLeuVaXTyrSerThrThrSerAM 
^CTAAGCAACTCGTTGCTACGTCACCACAATTTGGTGTATTC 
GTGATTCGTTGAGCAACGATGCAGTGGTGTTAAACCACAT 

ceo, ^J^ 011 ^* 310 ^ 

5521 CTTGCCAAAGGCAGAAGAAAGTttCATTO3ACAGA^ 
GAACGGTTTCCGTCTTCTTTCAGTGTAAACTGTCTC 

AspValLeuLysGluValLysMaAlaAlaSerLysValLysAlaAsnLeuLeuSerVal 
AGGACGTACTCAAGGAGGTTAAAGCAGCX^CGTCAAAAGTGAAGGCTAACTTGCTA 
TCCTGCATGAGTTCCTCCAATTTCGTCG<X 

GluGluAlaCysSerLeuThrProProHisScrAlaLysSerLysPheGlyTyrGlyAla 
TAGAGGAAGCTTGC&GCCTGACGCCCCCACACTCAG^^ 
ATCTCCTTCGAACGTO^CTGaa^ 

L y^ s P v ^lArgC^sHisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAsp 
5/01 CAJ^AOT r 

GTTTTCTGCAGGCAACGGTACGGTCTTT^ 

LeuI^uGluAspAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPhe 
5761 ACCTTCTGGAAGACAATGTAACACCAATAGACACTACCATCATGGCTAAGAAC^ 
TGGAAGACCTTCTCITACATTGT^TTATCTGTGATGGT^ 

CysValGlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPhePrcaAspLeu 
5821 TCTGCGTTCAGCCTGAGAAGGGGGGTTCTAAGCCAC^ 

AGACGCAAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGT^ 

GlyValArgValCysGluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAla 
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5881 TGGGCGTGCGCGTGTGCGAAAAGATGGCTTTCTACGACGTGGTTACAAAGC 
ACCCGCACGCGCACACGCTTTTCTACOGAAACATGCTG 

ValMetGlySerSerTyrGlyPheGlnTyr^ 
594 1 CCGTGATGGGAAGCTCCTACGGATTCC^ 

GGCACTACCCTTCGAGGATGCCTAAGGTT^^ 

GInMaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCy 
6001 TGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACCC^ 
ACG1TCG<^CCTTCAGGTTCTTTTG 

SerThrValThrGluSerAspIleArg^ 
6061 ACTCCACAGTCACTGAGAGOGACATCXXSTAOGGAGGAGGCAATCT 
TGAGGTGTCAGTGACTCTCGCTCTAGGCATGCCTCCTCC^ 

AspProGlnMaArgValAlalleLysSerLeuThrGluArgLeuTyrValGlyGlyPro 
6121 TCGACCCCCAAGCCCX5CGTGGCCATCAAGTCXX2TCACCGAGAGGCTTTATC 
AGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGG 

LeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValLeu 
6181 CTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGC^ 
GAGAATGGTTAAGTTCCCCCCTCTTGACGCOGA1AGCGTCCAOGGCGCG 

ThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAla 
6241 TGACAACTAGCTCTGGTAACACCCTCACTTGCTACATCAAGGCCCGGGCAGCCTGTCGAG 
ACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTC 

MaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeuValVallleCysGlu 
6301 CCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGC 

GGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATC^ 

SerAlaGlyValGlnGluAspAlaMaSerLeuArgAlaPheThrGluAlaMetThrArg 
6361 AAAGCGCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGAC^ 
TTTCGCGCCCCCAGGTCCTCC^^ 

TyrSerAlaProProGlyAspProProGlnProGlulVrAspLeuGluLeuIleThrSer 
6421 GGTACTCCGCCCCCCCTX^^CCC^^ 

CCATGAGGCGGGGGGGACCCCTGGGGGGTGTTGGTCTTATGCTO 

CysSerSerAsnValSerValAlaHisAspGlyAlaGlyLysArgVall^rTyrLeuThr 
6481 CATGCTCCTCCAACGTGTCAGTCGCCCACGAOGGCGCT 

GTACGAGGAGGTTGCACAGTCAGCGGGTGCTGCCGCGACC^^ 

ArgAspProThrThrProLeuAlaArgAlaMaTrpGluThrAlaArgHisThrProVal 
6541 CCCGTGACCCTACAACCCCCCTCGOGAGAGCTCCGTTC 

GGGCACTGGGATGTTGGGGGGAGCGCTCTCGACGCACCCTC 

AsnSerTrpLeuGlyAsnllelleMetPheAlaProThrLeuTrpAlaArgMetlleLeu 
6601 TCAATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCAC^ 

AGTTAAGGACCGATCOGTTGTATTAGTACAAACGGGGGTGTGACACCCGCT^ 

MetThrHisPhePheSerValLeuIleAlaArgAspGlnLeuGluGlnAlaLeuAspCys 

6661 TGATCACCCATTTCTTTAGCGTC^ 

ACTACTGGGTAAAGAAATCGCAGGAATATCGGTCCCTGGTCGM 

GluIleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuProProIlelleGlnArg 
6721 GCGAGATCTACGGGGCCTGCTACTCCATAGAACCACTTGATCTACCTCCAATCA 

CGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGTAAGTTT 



Leu 

6781 GACTC 
CTGAG 
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FIG 4 7- I CCMBINED 0RF 0F DNAs K9 ~ 1 through 15e 



GlyCysProGluArgLeuAlaSerCysArgProLeuThrAspPheAspGlnGlyTrpGly 
1 CAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGG 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ProIleSerTyrAlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyrPro 
6 1 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGG 

ProLysProCysGlylleValProAlaLysSerValCysGlyProValTyrCysPheThr 
121 CCCCAAAACCTTGCGGTATTGTGCCOGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCA 
GGGGTTTTGGAA03CCATAACACGGGCGCTTCTCACACA 

ProSerProValValValGlyThrThrAspArgSerGly^ 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTOGGGCGCGCCCACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCC 

GluAsnAspThrAspValPheVall^uAsnAsnThrArgProProLeuGlyAsnTrpPhe 
241 GTGAAAATGATACGGACGTCTTCGTCCTTAACAAXACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCC1XKIAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThrGlyPheThrLysValCysGlyAlaProProCysVal 
301 TCGGTTGTACCTGGATGAACTCAACTGGATTCACCAAAGTGTC 

AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

IleGlyGlyAlaGlyAsnAsnThrl^uHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSerArgCysGlySerGlyProTrpIleThrProArgCysLeuValAsp 
421 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 

TyrProTyrArgLeuTrpHisTyrProCysThrlleAsnTyrThrllePheLysIleArg 
481 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCA 
TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGT 

Metl^rValGlyGlyValGluHisArgl^uGluAlaMaCysAsnTrpThrArgGlyGlu 
54 1 GGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTTGTCTCCGACCTTCGAOGGACGTTGACCTGCGCCCCGC 

ArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeuLeuI^uThrThrThr 
601 AACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTA 
TTGCAACGCTAGACCTTCTCTCCCTGTCCAG^ 

GlnTrpGlnValLeuProCysSerPheThrthrLeuProAlaLeuSerThrGlyLeuIle 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGT 

HisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGG 
AGGTGGAGGTGGTCTTGTAACACCTCCACGTCATGAACATGCCC^ 

SerTrpMalleLysTrpGluTyrValVal^ 
781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACCCTCATGCAGC^GAGGA 

ValCysSerCysLeuTr^ 
iB4 1 GCXSTCTGCTCCTGeirrGIX^AtGATG 
CG<^GAd3A<&ACGAA^ 

I^uVaiil^I>euAsnAiaAiaSerL^uAlaGlyThrH 
901 ACCTCGTAATACTTAATCCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 
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961 TGTTCTTCTCCrrTGCAT^TATTrGAAGGGTAAGTGGGTGCCCW^MTCT/^CT 
ACAAG^GACGAAACGTACCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA 

TvrGlvMetTrpPioI«ulAUl*uLeuI^uI^ 
1021 TCTACGGGATCTTCCCTCTCCTCCTGCTCCra 

aStccccta^c^gaggacmacgagga^ 

AspThrGluVamaAlaSerCysGlyGlyValVall^uV^ 



1081 

AO 



1141 



1201 TG 



l*uSerProTyrTyrLysArgTyrlleSerTrp^ 
CT^TCACCATATTACAAGCGCTATATCAGCTGGTC 

GJ^^GT^TATAATGTTCGCGATATAGTCGA<XACGAA<^CCAC0GAAGTCATAAAA6 

tgaccmagtggaag^ 

ACTWTCTCACCTTOGCGTTSAOGTGCACAOCTARfiGGGGGGASTTGCAGGCTCCCCOOG 
AspAlaVallleLeuLeuMetCysAlaValHisProThrLeuValPheAspIleThrLys 

1261 SSSSSSSS^S 

LeuLeuI«uAlaValPheGlyProLeuTrpIleI«uGlnAlaSerLeuLeuLysValPro 

1321 AATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTC 

TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATG 

TvrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeuAlaArgLysMetll^ly 
1381 CCTACTTTGTGCGCGTCCAAGGCCl^CCGGTTCTGCGCGTTAGCG^A^A^TOG 
GGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGC 

GlyHisTyrValGlnMetVameIleLysI*uGlyAlaI«uT^ 
1441 GAGGCCATTACGTGCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGC^OT 

CTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAA 

AsnHisLeuThrProI*uArgAspTrpAlaHisAsnGlyr«uArgAspI«uAlaValAla 

1501 ATAACCATCTCACTCCTCTTCGGGACTGGGCGCAC^^ 

TATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACC 

ValGluProValValPheSei^lnHetGluThrLysLeuIleThrTrpGlyAlaAspThr 
1561 CTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCAOGTGGQGGGCAGATA 
GA^TCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCC^ 

AlaAlaCysGlyAspIlelleAsnGlyl^uProValSerAlaArgArg^yA^luIle 
1621 CCGCCGGGTGCGGTGACAXCATCAACGGCTTGCCTGTTTCCGCCCGCAG^ 
GG^GCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGGC^ 

LeuI^uGlyProMaAspGlyMetValSerLysGlyTrpArgLeuLeuA^PwI^Tlu: 
1681 TACIGCTCGGGCCAGCCX^TGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCA 
ATGMGAGCCCGGTCGCTC 

AlaTyrAlaGlnGlnltoArgGlyLeuI^uGlyCysIlelleThrSerlAuThrGlyArg 

1741 ss^^ 

i. AspLysAsnGlnValGluGlyGluValGlnl^^ 
1801 GGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTCTGTCAACTG^ 
COCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACAC^ 




AlaSerProLysGlyProVallieGlnMctTyrThrAsnValAspGlnAspIieuValGly 
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1921 TCGCGTCACCCAAGGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGG 
AGCGGAGTGGGTTCCCAGGACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACC 

TrpProAlaProGlnGlySerArgSerLeuThrProCysThrCysGlySerSerAspLeu 
1981 GCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACC 
CGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGG 

TyrLeuValThrArgHisAlaAspVallleProValArgArgArgGlyAspSerArgGly 
204 1 TTTACCTGGTCACGAGG<^CGCCGATCTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGG 
AAATGGACCAGTGCTCCGTGCGGCTACAGTMGGGCACGCGGCCGCCCCACTATCGTCCC 

SerLeuLeuSerProArgProIleSerTyrLeuLysGlySerSerGlyGlyProLeuLeu 
2101 GCAGCCTGCTGTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTC 

CGTCGGACGACAGCGGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACA 

CysProAlaGlyHisAlaValGlyllePheAi^MaAlaValCysThrArgGlyValAla 
216 1 TGTGCCCCGCGGGGCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGG 
ACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACC 

LysAlaValAspPheIleProValGluAsnLeuGluThrThrHetArgSerProVea.Phe 
2221 CTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGT 
GATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACA 

ThrAspAsnSerSerProProValValProGlnSerPheGlnValAlaHisLeuHisAla 
2281 TQUXMATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATO 
AGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTAC 

ProTlirGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLys 
2341 CTCCCACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATA 
GAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATAC^TCGAGTCCCGATAT 

ValLeuValLeuAsnProSerValAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLys 
2401 AGGTCCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCA 
TCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGT 

AlaHisGlylleAspProAsnlleArgThrGlyValArgThrlleThrThrGlySerPro 
2461 AGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCC 
TCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGG 

IleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyr 
2521 CCATCACGTACTCCACCTAOGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTT 
GGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCC 

AspIlellelleCysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGly 
2581 ATGACATAATAATTTGTGACGAGTGCCACTCCAOGGATGCCACATCCATCTTGGGCATOG 
TACTGTATTATTAAACACTGCTCAQGGTGAGGTGTC 

ThrValLeuAspGlnAlaGluThrAlaGlyAla^ 
2641 GCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTC^ 
CGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGA^ 

ProProGlySeiValThxValProH^^ 
2701 (XCX^CGGGCTCXXm^CTGTGC<XCATCCCA^ 

GGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTC 

GlyGlulleProPheTyrGlyLysMalleProLeuGluVallleLysGlyGlyArgHis 
2761 COGGAGAGATCCCTTTTTACGGCAAGGCTATCCCC^ 

GGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTT^ 

teuIlePheCysHisSerLysLysLysCysAspGluI^uAlaAlaLysI^uValAlaLe^ 
v - 2821, ATCSPCATCT^^ 

^ • ; ; , : ^ 

; ■ \GlyIleAfi 

2881 TCGGCATCAATGCCGTGGCCTACTACCGCGGTCTTG ACGTGTCCG 
ACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTC 
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AspValValValValAlaThrAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSw 
2941 GCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACT 
GGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGA 

VallleAspCysAsnThrCysValThrGlnThrValAspPheSerLeuAspProThrPhe 
3001 CGGTGATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCT 
GCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGA 

ThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArg 
3061 TCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCA 
AGTGGTAACTCTGTTAGTCCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGT 

ThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGly 
3121 GGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCG 
CCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGC 

MetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeu 
3181 GCATGTTCGACTCGTCCGl^TCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGC 
CGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCG 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeiiProVad 
3241 TCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCG 
AGTGCGGGCGGCTCTGATGTCAATGC^ATGCTCGCATGTACTTCTGGGGCCCCGAAGGGC 

CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAla 
3301 TGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATC 
ACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTAC 

HisPheLeuSerGlnl^LysGlnSexGlyGluAsnLeuProTyrLeuValAlaTyrGln 
3361 CCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACC 
GGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGG 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 
34 21 AAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGT 
TTCGGTGGCACACGCGATCCCX^AGTTCGGGGAGGGGGTAGCACCCTG^ 

Leul leArgLeuLy sProThrLeuHisGly PrOThr ProLeuLeuTyr ArgLeuGlyAla 
3481 GTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTCGGCG 
CAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGAT 

ValGlnAsnGluIleThrLeuThrHisProValThrLysTyrIleMetThr<^sMetS^ 
3541 CTGTTCAGAATGAAATCACXXZTGACGCACCCAGTCACCAAATACATCATGAC^ 
GACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTO 

AlaAspLeuGluValValThrSerThrTrpVal^^ 
3601 CGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCXjTC 
GCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACC^ 

AlaMaTyrCysI^uSerThrGlyCysValVallleValGlyArgValValLeviS 
3661 TGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGG^ 

ACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAG 

LysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGlu 
3721 GG AAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGA 
QCTTCGGCCGTTAGTATGGACTGTCCCTTC^ 

. C^sSerGlnHisI^uProTyrlleGluG^ 
3781 AGTGCTCTCAGGACTTACCGTACATCGAGCAAGGGATGATGCTOGCCGAGCAGTTCAA^ 
^CACGAGAGTCGTGAATCGCATGXAGCTCGTTCCCTACTACG 

: ; ^ Itf sAlaLeuGiyi^^ 
3841: ?;AGAAGGCCCttX^CTC^ 

GlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSer 
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3901 TCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCG AAGCATATGTGGAACTTCATCA 
AGGTCTGGT1GACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGT 

GlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeu 
3961 GTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCAT 
CACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTA 

MetAlaPheThrAlaAlaValThrSerProLeuThrThrSertlnThrLeuLeuPheAsn 
4021 TGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCA 
ACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTT1X5GGAGGAGAAGT 

IleLeuGlyGlyTrpValMaMaGlnl^uMaMaProGlyAlaMaThrAlaPheVal 
4081 ACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTA 

TGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAAC 

GlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsp 
4141 TCGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAG 
ACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATC 

Iiel^uAlaGlyTyrGlyAlaGlyVamaGlyAlal^uValAlaPheLysIleMetSer 
4201 ACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTCGCATTCAAGATCAT^ 
TGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACT 

GlyGluValProSerThrGluAspl^uValAsnl^uI^uProAlalleLeuSerProGly 
4261 GCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCG 
CGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGC 

AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGlu 
GAGCCCTCGTAGTCGGCGTGGTCTCTGCAGCAATACTGCGCCX3GCACGTTGGCCCGGGCG 
CTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCG 

GlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
AGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTT 
TCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAA 

^^ProThrHisTyrValProGluSerAspAlaAlaAlaArgValThrAlallcLeuSerSer 
CCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCA 
GGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGT 

CGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCA 

P^ysSerGlySerTrpI^uArgAsplleTrpAspTrpIleC^sGluValLeiiSerAsp 
CI^TC^CGGTTCCra^ 

GAGGTACGAGGCCAAGGACCGATTCCCTGXAGACCC0X3AC 

PheLysThrTrpI^uLysAlaLysLeuMetProGlnLeuProGlylleProPheValSer 
4621 ACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGT 
TGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGAC^ 

« ftl ^^GlnArgGlyTyrLysGlyValTrpArgValAspGlylleM^ 
4681 CCTGCCAGOGOGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTCCC 
GGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGOS 

4741 ACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATCAGGATCGTCGGTCCTA 
TGACACCTOGACTCTAGTCACCTGTAC^ 

4 8Q1, WACCTGCAGGAACATGTGGAGTO 



4861 



4321 



4381 



4441 



4501 



4561 



ThrProI^uProAlaProAs^ 
GTACfcCCCCTTCCTGO^ 

CATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTA 
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ValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMetThrThrAspAsnLeu 
4921 ATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATC 
TACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAG 

LysCysProCysGlnValProSerProGluPhePheThrGluLeuAspGlyValArgLeu 
4981 TCAAATGCCCG TGCCAGGTCCCATCGCCCG AATTTTTCACAG AATTGGACGGGG TGCGCC 
AGTTtACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGG 

HisArgPheAlaProPrc^sLysProl^uLexiArgGluGluValSerPheAr^ValGly 
5041 TACATAGGTTTGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAG 
ATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATC 

I^uHisGluTyrProValGlySerGlnLe^ 
5101 GACTCCACGAATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCra 
CTGAC&TGCTTATGGGCCATCCCAGCGTT^ 

LeuThrSerMetLeuThrAspProSerHisIleThrAlaGluAlaAlaGlyArgArgLeu 
5161 TGTTGACGTCGATGCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGG^ 
ACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCA 

AlaArgGlySerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSer 
5221 TGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCAT 
ACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTOGATAGGCGAGGTA 

LeuLysAlaThrCysThrAlaAsnHisAspSerProAspAlaGluLeuIleGluAlaAsn 
5281 CTCTCAAGGCAACTTGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCA 
GAGAGTTCCGTTGAACGTGGCGATTGGTAC'IGAGGGGACTACGACTCGAGTATCTCGGGT 

LeuLeuTrpArgGlnGluHetGlyGlyAsnlleThrArgValGluSezGluAsnLysVal 
5341 ACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAG 
TGGAGGATACCTCCGTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTC 

VallleLeuAspSerPheAspProLeuValAlaGluGluAspGluArgGluIleSerVal 
54 01 TGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGA(^AGOGGGAGATCTCra 
ACC^CTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGC 

ProAlaGluIleLeuArgLysSerArgArgPheAlaGlnAlaLeuProValTrpAlaArg 
5461 TACCCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGl^ 

ATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACGCGOG 

ProAspTyrAsnProProLeuValGluThrTrpLysLysProAspTyrGluProProVal 
5521 GGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTG 
CCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTCGGGCTGA 

ValHisGlyCysProLeuProProProLysSerProProValProProProArgLysLys 
5581 TGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCC^ 

ACCAGGTACCGACAGGCGAAGGTGGAGGraTCAGGGGAGGACACGGAGGCG^ 

ArgThrValValLeuThrGluSerThrl^uSerThr^ 
5641 AGO&ACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGG 

TCXSCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGAOGGAACCGGCTCGAGCGG 

SerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrThrThrSerSex^ 
5701 GAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTC 
CTTCGAAACCGTCGAGGAGTTCAAGGCCGTAATGCCCSCTGTTATG 

ProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSerMetProPro 
5761 AGCCCGCCCCTTCTGGCTGCCTC^ 

TCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGG 

LeuGluGlYGluProGlyAspProAspI^uSerAfipGlySerTrpSerThrValSerSer 
582r CCC^ 

GGGACCTCqCCQ?^ 

GluAlaAsnAlaGluAspValValCysCysSerMetSer^ 
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5881 GTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCAC 
CACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTG 

ValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeu 
5941 TCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTOGT 
AGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCA 

I^uArgHisHisAsnl^uValTyrSerThrThrSerArgSerAlaCysGlnArgGlnLys 
6001 TGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAA 
ACGATGCAGTGGTGTTAAACCACATAAGGTCGT^ 

LysValThrPheAspArgl^uGlnVall^uAspSerHisTyrGlnAspVall^uLysGlu 
6061 AGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGG 
TCTTTCAGTGTAAACTCTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATC 

ValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGluAlaCysSer 
6121 AGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCA 
TCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCCTTCGAACGT 

LeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAspValArgCys 
6181 GCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAMACGTCOGTT 
CGGACTGCGGGGGTGTGMTCGGTTTAGGTTCAAACCAATACCCCGTTTTCTCCAGGCAA 

HisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAspLeuLeuGluAspAsn 
624 1 GCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACA 
CGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGT 

ValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysValGlnProGlu 
6301 ATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTG 
TACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGCAAGTCGGAC 

LysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyValArgValCys 
6361 AGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGT 
TCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCACA 

GluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAlaValMetGlySerSer 
6421 GCGAAAAGATGGCTTTGTAGGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCT 
CGCTTTTCXACOGAAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGA 

TyrGlyPheGlnTyrSerProGlyGlnArg^ 
6481 CCTAOTGATTCO\ATACTCACCAGGACAGCGGGTTGAATTC 

GGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCA 

LysLysThrProMetGlyPheSerTyrAspT^ 
6541 CCAAGAAAACCCCAA1XX3GGTTCTCGTATGATACCCGCTGC 

GGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTG 

SerAspIleArgThrGluGluMalleTyrGlnCysCysAspLeuAspProGlnM 
6601 AGAGCGACATCCGTACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCC 
TCTCGCTGTAGGCATGCCTCCTCCGTTAGAIX^TTACAACACTGGAGCTGGGGGTTCGOT 

ValAlalleLysSerLeuThrGluArgLeuTyrValGlyGlyProLeuThrAsnSerArg 
6661 GCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTATCTTGGGGGCCCTCTTACCAATTCAA 
OGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTT 

GlyGluAsnCysGlyTyrArgArgCysArgMaSerGl^ 
6721 GGGGGG AG AACTGCGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTG 
CCGCCCtCTTGACGCCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACAC 

AsnThrLeuThrCysTyrlleLysA 
6781 GT^CACCdTCACTTGCTA<^TGAAGGCCCGGGC^GCCTG.TG^^ 

6841 A<^<AC(^TGCTCGTC^ 

TGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACACTTTOGCGCCCCCAGG 
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GlxiAspMa^aSerl^uArgAlaPheThrGliiAlaMetThrArgTyrSerAlaProPro 
6901 AGGAGGACGCGGCGAGCCTGAGJ^CTTCAC^^ 
TCCTTCTGCGCCGCTOGGA^ 

GlyAspProProGlnProGluTyrAspteiiGluI^uIlcThrSerC^ 
6961 CTCGGGACCCCCCACAACCAGAATACGACTTGGAGCTCAIAAGAT^ 
GACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTAT^ 

SerValMaHisAspGlyAlaGlyLysArgValTyrTyrl^uThr 
7021 TGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACTACCTCAC 
ACAGTCAGCGGGTGCTGCO^ACCTTTCTCC 

ProI^uMaArgAlaMaTrpGluThrMaArgHlsThrProValAsnSerTrpLeuG 
7081 CCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCT 
GGGGGGAGCGCTCTCGACGCACCCTCTGTCGTTCTCTGTGAGGTCAGTT^ 

AsnllelleMetPheAlaProThrLeuTrpAlaArgMetlleLeuMetThrHisPhePhe 
7141 GCAACATAATCATGTTTGCCCCCACACTC 

CGTTGTATTAGTACAAAC^GGGGTGTGACACCCGCTCCTAC^ 

SerVall^uIleAlaArgAspGlitf^uGluGlnAla^ 
7 201 TTAGCG TCCTTATAGCCAGGGACCAGCTTG AACAGGCCCTCGATTGCGAGATCTACGGGG 
AATCGCAGGAATATCGGTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATC 

C^sTyrSerlleGluProLeuAspLeuProProIlelleGlJiAMlieu 
7261 CCTGCTACTCCATAGAACCACTTGATCTACCTCCAATCATTCAAM 

GGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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